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such proteins, as well as fragments and 
biologically functional variants thereof. The 
invention also pertains to therapeutics and 
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OSTEOCLAST TRANSPORTER PROTEIN 



Government Support 

This work was funded in part by the NIDDK under Grant Number R01 DK45639. The 
5 United States government may have certain rights to this invention. 

Field of the Invention 

This invention relates to osteoclast transporter genes, proteins coded for by such genes, 
and diagnostics and therapeutics related to medical conditions associated with such genes and 
10 proteins, including osteoporosis and osteopetrosis. 

Background of the Invention 

The remodeling of bone is a dynamic process. Cells continuously lay down and resorb 
bone material An imbalance in the activity of cells that lay down new bone (osteoblasts) and 
15 cells that resorb bone (osteoclasts) can result in serious, and sometimes even fatal, disorders. 

Osteoporosis is a term used for a number of diseases of diverse etiology, all involving a 
reduction in the mass of bone per unit volume. Osteoporosis is the most common of the 
metabolic bone diseases. Twenty-five million people in the United States and more than two 
hundred million people worldwide are affected by osteoporosis. Osteoporosis is frequent among 
20 post-menopausal women and is an important cause of morbidity in the elderly. It commonly 
results in bone fractures, and death can be a frequent occurance in the months following 
fractures, particularly those of the hip in elderly individuals. 

Osteopetrosis is a disorder involving an increase in the mass of bone per unit volume. Its 
incidence is rare compared to osteoporosis, but it typically is life threatening. Despite having 
25 multiple causes, a defect in bone resorption is always the underlying mechanism. In many 
instances, the disorder is inherited as an autosomal recessive trait and involves abnormal 
osteoclast function. Bone marrow transplants from normal donors have been attempted to 
restore normal osteoclast precursor cells, but this therapy has shown only limited success. 

Present treatments for osteoporosis and osteopetrosis are inadequate. 
30 There exists a need to influence favorably the bone remodeling process to treat 

osteoporosis and osteopetrosis. There also exists a need to identify the gene(s) responsible for 
osteopetrosis and to provide a genetic therapy for treating osteopetrosis. 
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An object of the invention is to provide compounds that desirably influence the bone 
remodeling process. 

Another object of the invention is to provide therapeutics for treating osteoporosis. 
Another object of the invention is to provide therapeutics for treating osteopetrosis. 
Still another object of the invention is to provide diagnostics and research tools relating to 
osteoporosis and osteopetrosis. These and other objects will be described in greater detail below. 

Summary of the Invention 

The invention involves in one respect the discovery of osteoclast transporter proteins and 
nucleic acid molecules encoding those proteins. The expression and biological activity of the 
proteins are necessary for normal osteoclast function, and alteration of the expression or 
biological activity of these proteins can be used to influence osteoclast activity and thereby affect 
bone remodeling. In addition, normal osteoclast function can be established in abnormal 
osteoclasts lacking a normal osteoclast transporter protein by supplying to the abnormal 
osteoclast a nucleic acid expressing a functional osteoclast transporter protein. 

The preferred nucleic acids of the invention are homologs and alleles of the nucleic acids 
of SEQ ID NO:l and SEQ ID NO:3. The invention further embraces functional equivalents, 
variants, analogs and fragments of the foregoing nucleic acids and also embraces proteins and 
peptides coded for by any of the foregoing. 

According to one aspect of the invention, an isolated nucleic acid molecule is provided. 
The molecule hybridizes under stringent conditions to a molecule consisting of the nucleic acid 
sequence of SEQ ID NO: 1 or SEQ ID NO:7 and it codes for an osteoclast transporter molecule. 
The invention further embraces nucleic acid molecules that differ from the foregoing isolated 
nucleic acid molecules in codon sequence due to the degeneracy of the genetic code. The 
invention also embraces complements of the foregoing nucleic acids. 

Preferred isolated nucleic acid molecules are those which hybridize under stringent 
conditions to a nucleic acid molecule consisting of the nucleotide sequence of SEQ ID NO:3, 
particularly those comprising the human cDNAs or gene corresponding to SEQ ID NO:3 and the 
cDNAs or gene corresponding to SEQ ID NO:l . 

The invention in another aspect involves expression vectors, and host cells transformed or 
transfected with such expression vectors, comprising the nucleic acid molecules described above. 
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In one embodiment of the invention, the host cell is a hematopoietic osteoclast precursor cell, 
such as a stem cell. 

According to another aspect of the invention, an isolated nucleic acid molecule is 
provided which comprises a unique fragment of SEQ ID NO:l between 12 and 2101 nucleotides 

5 in length or a unique fragment of nucleotides 1-2112 of SEQ ID NO:3 between 12 and 21 1 1 
nucleotides in length, and complements thereof. In one embodiment, the unique fragment is at 
least 1 50 and more preferably at least 200 nucleotides in length. In another embodiment, the 
unique fragment is between 12 and 32 contiguous nucleotides in length. In some embodiments, 
the isolated nucleic acid molecule is a unique fragment of SEQ ID NO:l between 12 and 1974 

1 0 nucleotides in length or a unique fragment of nucleotides 1 20- 1 733 of SEQ ID NO: 1 between 1 2 
and 1612 nucleotides in length. In another embodiment, the isolated nucleic acid molecule is 
selected from the group consisting of a unique fragment of nucleotides 1-1 179 of SEQ ID NO:3 
between 12 and 1 178 nucleotides in length and a unique fragment of nucleotides 1512-21 12 of 
SEQ ID NO:3 between 12 and 599 nucleotides in length. In certain embodiments, the isolated 

15 nucleic acid molecule is a unique fragment of the coding region of the human osteoclast 

transporter gene. Such isolated nucleic acid molecule include unique fragments of nucleotides 
73-1758 of SEQ ID NO:3 between 12 and 1684 nucleotides in length. Preferably, unique 
fragments of the coding region include isolated nucleic acid molecules selected from the group 
consisting of unique fragments of nucleotides 73-1 179 of SEQ ID NO:3 between 12 and 1 105 

20 nucleotides in length and unique fragments of nucleotides 1 5 1 2-1 758 of SEQ ID NO:3 between 
12 and 245 nucleotides in length. 

According to another aspect of the invention, isolated polypeptides coded for by the 
isolated nucleic acid molecules described above also are provided as well as functional 
equivalents, variants, analogs and fragments thereof. In one embodiment, the polypeptide is a 

25 human osteoclast transporter protein or a functionally active fragment or variants thereof. 

The invention also provides isolated polypeptides which selectively bind an osteoclast 
transporter protein or fragments thereof. Isolated binding polypeptides include antibodies and 
fragments of antibodies (e.g. Fab, F(ab) 2 , Fd and antibody fragments which include a CDR3 
region which binds selectively to the osteoclast transporter proteins of the invention). Preferred 

30 isolated binding polypeptides are those that bind an extracellular portion of the osteoclast 
transporter proteins of the invention. 



r 
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The invention in another aspect involves a method for decreasing osteoclast activity in a 
subject. An agent that selectively binds to an isolated nucleic acid molecule as described above 
or an expression product thereof is administered to a subject in need of such treatment, in an 
amount effective to decrease osteoclast activity in the subject. In one embodiment, the agent 
5 selectively binds to an extracellular domain of the expression product. Preferred agents are 
modified antisense nucleic acids and polypeptides. 

The invention is a further aspect provides for the use of agents which bind to the 
foregoing nucleic acids and expression products thereof in the preparation of a medicament. 
Preferred agents are modified antisense nucleic acids and polypeptides. 
10 The invention also contemplates gene therapy for osteopetrosis, wherein defective stem 

cells of a donor are genetically engineered to include an isolated nucleic acid expressing a 
functional osteoclast transporter protein. The cells then are returned to the donor. 

In the same manner as described above kidney cell tubule activity may be modulated 
using the methods and products of the invention. 

15 

Brief Description of the Drawing 

FIG. 1 is a graph depicting hydrophobicity vs. amino acid for the mouse osteoclast 
transporter protein. 

20 Detailed Description of the Invention 

The present invention in one aspect involves the cloning of a gene encoding an osteoclast 
transporter protein. The sequence of the gene (from mouse) is presented as SEQ ID NO: I . and 
the predicted amino acid sequence of this gene*s protein product is presented as SEQ ID NO:2. 
The mRNA transcript is about 2.0 kb in length. It was obtained from mouse kidney tubule cells. 

25 A partial clone of the apparent human homolog was identified in GenBank, accession no. 

H20345, and is presented as SEQ ID NO:7. The partial clone is available from American Type 
Culture Collection Depository. Rockville, Maryland. The human cDNA encoding the osteoclast 
transporter protein was isolated by a standard hybridization protocol using probes consisting of 
the 5' end of the mouse gene (nucleotides 45-357 of SEQ ID NO: 1 ) and the partial human gene 

30 sequence (SEQ ID NO:7). The sequence of the full length human cDNA is presented as SEQ ID 
NO:3, and the predicted amino acid sequence of the human protein product is presented as SEQ 
ID NO:4. 
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The mouse gene maps to the proximal portion of chromosome 19, in a region to which 
the osteosclerosis (oc) mutation has been assigned previously. The phenotype of oc/oc mutant 
mice includes osteopetrosis, and the osteoclasts of these mutant mice, although present, appear to 
be nonfunctional. The osteoclast transporter protein of the present invention is not expressed in 

5 oc/oc mutant mice, although Southern analysis indicates that the transporter locus is 

unrearranged and intact in these mutant mice. The osteoclast transporter protein of the present 
invention, however, is expressed in normal osteoclasts, and it is believed that hereditary 
osteopetrosis results from mutations to this gene, which impair osteoclast function. 

The gene is not expressed in any other normal tissue tested, except in kidney tubule cells 

10 where its function is believed to be redundant. In particular. Northern analysis demonstrated that 
the gene is not expressed in normal skeletal muscle, pancreas, lung, heart, brain, stomach, spleen, 
salivary gland, thymus, liver, large intestine or small intestine tissue. 

Searches of GenBank for similar related proteins shows that this protein shares some very 
limited, localized homology and sequence motifs to known proteins with transport functions. 

15 The protein of the invention, however, does have a length and tertiary structures similar to 
known transporters. 

Specifically, it appears to be a member of a large family of genes with transport functions 
called the "major facilitator superfamily" by Marger and Saier, which include a number of uni-, 
sym- and antiporters specific for sugars, organic acids and drugs. The common structural motif 

20 of this family are 12 transmembrane alpha helices, and often includes a central cytoplasmic loop 
between the 6th and 7th membrane-spanning domains. The hydrophobicity plot for the protein 
of the present invention is shown in Fig. 1 . Each asterisk indicates a potential membrane 
spanning hydrophobic domain. There appears to be an extracellular loop between the first and 
second hydrophobic domains, an intra-cytoplasmic loop between the hydrophobic domains 6 and 

25 7, and a 3' intra-cytoplasmic tail. The 5' end also appears to be interna] within the cytoplasm. 
The transporter also contains amino-acid motifs found in this class of molecules, 
including a sequence, D-R-F-G-R-K (SEQ ID NO:5), similar to a D-R/K-X-R-R/K sequence 
(SEQ ID NO:6) after the 2nd membrane domain, and a P-E-S/T sequence found after the 12th 
transmembrane domain. For genes with known function, the transporter is most closely related 

30 to the polyspecific rat cation transporter Octl (30% amino acid identity and 50% similarity), 
which has been shown to transport a variety of organic cationic drugs (Grundemann et aL, 
Nature, Vol. 372, December 1994.) 
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The invention thus involves osteoclast transport proteins, genes encoding those proteins, 
functional modifications and variants of the foregoing, useful fragments of the foregoing, as well 
as therapeutics and diagnostics relating thereto, 

Homologs and alleles of the osteoclast transporter genes of the invention can be identified 
5 by conventional techniques. Thus, an aspect of the invention is those nucleic acid sequences 
which code for osteoclast transporter proteins and which hybridize to a nucleic acid molecule 
consisting of SEQ ID NO: 1 or SEQ ID NO:3, under stringent conditions. The term "stringent 
conditions"' as used herein refers to parameters with which the art is familiar. More specifically, 
stringent conditions, as used herein, refers to hybridization at 65°C in hybridization buffer (3.5 x 

10 SSC, 0.02% FicolL 0.02% polyvinyl pyrolidone, 0.02% Bovine Serum Albumin, 2.5mM 

NaH 2 P0 4 (pH7), 0.5% SDS, 2mM EDTA). SSC is 0.1 5M sodium ' chloride/0. 15M sodium citrate, 
ph7; SDS is sodium dodecyl sulphate; and EDTA is ethylcncdiaminetetracetic acid. After 
hybridization, the membrane upon which the DNA is transferred is washed at 2 x SSC at room 
temperature and then at 0.1 x SSC/0.1 x SDS at 65°C. 

15 There are other conditions, reagents, and so forth which can used, which result in a 

similar degree of stringency. The skilled artisan will be familiar with such conditions, and thus 
they are not given here. It will be understood, however, that the skilled artisan will be able to 
manipulate the conditions in a manner to permit the clear identification of homologs and alleles 
of osteoclast transporter proteins of the invention. The skilled artisan also is familiar with the 

20 methodology for screening cells and libraries for expression of such molecules which then are 
routinely isolated, followed by isolation of the pertinent nucleic acid molecule and sequencing. 

In general homologs and alleles typically will share at least 40% nucleotide identity 
and/or at least 50% amino acid identity to SEQ ID NOs: 1 or 3 and SEQ ID NOs:2 or 4, 
respectively, in some instances will share at least 50% nucleotide identity and/or at least 65% 

25 amino acid identity and in still other instances will share at least 60% nucleotide identity and/or 
at least 75% amino acid identity. Watson-Crick complements of the forgoing nucleic acids also 
are embraced by the invention. 

In screening for osteoclast transporter protein family members, a Southern blot may be 
performed using the foregoing conditions, together with a radioactive probe. After washing the 

30 membrane to which the DNA is finally transferred, the membrane can be placed against x-ray 
film to detect the radioactive signal. 
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The invention also includes degenerate nucleic acids which include alternative codons to 
those present in the native materials. For example, serine residues are encoded by the codons 
TCA, AGT, TCC, TCG, TCT and AGC. Each of the six codons is equivalent for the purposes of 
encoding a serine residue. Thus, it will be apparent to one of ordinary skill in the art that any of 

5 the serine-encoding nucleotide triplets may be employed to direct the protein synthesis apparatus, 
in vitro or in vivo, to incorporate a serine residue. Similarly, nucleotide sequence triplets which 
encode other amino acid residues include, but are not limited to: CCA, CCC, CCG and CCT 
(proline codons); CGA, CGC, CGG, CGT, AGA and AGG (arginine codons); ACA, ACC, ACG 
and ACT (threonine codons); AAC and AAT (asparaginc codons); and ATA, ATC and ATT 

10 (isoleucine codons). Other amino acid residues may be encoded similarly by multiple nucleotide 
sequences. Thus, the invention embraces degenerate nucleic acids that differ from the 
biologically isolated nucleic acids in codon sequence due to the degeneracy of the genetic code. 

The invention also provides isolated unique fragments of SEQ ID NO:l, SEQ ID NO:3, 
or compliments of SEQ ID NO: 1 or SEQ ID NO:3. A unique fragment is one that is a 

15 'signature' for the larger nucleic acid. It, for example, is long enough to assure that its precise 
sequence is not found in molecules outside of the osteoclast transporter protein family as defined 
by claim 1 . Unique fragments can be used as probes in Southern blot assays to identify family 
members or can be used in amplification assays such as those employing PCR. As known to 
those skilled in the art, large probes such as 200 nucleotides or more are preferred for certain 

20 uses such as Southern blots, while smaller fragments will be preferred for uses such as PCR. 
Unique fragments also can be used to produce fusion proteins for generating antibodies or for 
generating immunoassay components. Likewise, unique fragments can be employed to produce 
fragments of the osteoclast transporter protein such as only the extracellular portion, useful, for 
example, in immunoassays or as a competitive inhibitor of the substrate of the osteoclast 

25 transporter protein in therapeutic applications. Unique fragments further can be used as antisense 
molecules to inhibit the expression of the osteoclast transporter proteins of the invention, 
particularly for therapeutic purposes as described in greater detail below. 

As will be recognized by those skilled in the art, the size of the unique fragment will 
depend upon its conservancy in the genetic code. Thus, some regions of SEQ ID NO:l and SEQ 

30 ID NO:3, and their complements, will require longer segments to be unique while others will 
require only short segments, typically between 12 and 32 nucleotides (e.g. 12, 13, 14, 15, 16, 17, 
18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31 and 32 bases long). Virtually any segment of 
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SEQ ID NO: 1 or SEQ ID NO:3, or their complements, that is 1 8 or more nucleotides in length 
will be unique. Those skilled in the art arc well versed in methods for selecting such sequences, 
typically on the basis of the ability of the unique fragment to selectively distinguish the sequence 
of interest from non-family members. A comparison of the sequence of the fragment to those on 
5 known data bases typically is all that is necessary, although in vitro confirmatory hybridization 
and sequencing analysis may be performed. 

As mentioned above, the invention embraces antisense oligonucleotides that selectively 
bind to a nucleic acid molecule encoding an osteoclast transporter protein, to decrease osteoclast 
activity. This is desirable in virtually any medical condition wherein a reduction in osteoclast 

10 activity is desirable, including when a reduction in bone loss or an increase in bone mass is 
desired. By decreasing the osteoclast activity in a subject, bone remodeling thus can be 
favorably affected. Antisense molecules, in this manner, can be used to slow down or arrest the 
loss in bone mass occurring with certain forms of osteoporosis and may even result in an 
increase in bone mass in circumstances where an increase in bone mass is desirable. 

15 As used herein, the term "antisense oligonucleotide" or '"antisense" describes an 

oligonucleotide that is an oligoribonucleotide, oligodeoxyribonucleotide, modified 
oligoribonucleotide, or modified oligodeoxyribonucleotide which hybridizes under physiological 
conditions to DNA comprising a particular gene or to an mKN A transcript of that gene and, 
thereby, inhibits the transcription of that gene and/or the translation of that mRN A. The 

20 antisense molecules are designed so as to interfere with transcription or translation of a target 
gene upon hybridization with the target gene. Those skilled in the art will recognize that the 
exact length of the antisense oligonucleotide and its degree of complementarity with its target 
will depend upon the specific target selected, including the sequence of the target and the 
particular bases which comprise that sequence. It is preferred that the antisense oligonucleotide 

25 be constructed and arranged so as to bind selectively with the target under physiological 

conditions, i.e., to hybridize substantially more to the target sequence than to any other sequence 
in the target cell under physiological conditions. Based upon SEQ ID NO:l and SEQ ID NO:3, 
or upon allelic or homologous genomic and/or cDNA sequences, one of skill in the art can easily 
choose and synthesize any of a number of appropriate antisense molecules for use in accordance 

30 with the present invention. In order to be sufficiently selective and potent for inhibition, such 
antisense oligonucleotides should comprise at least 10 and, more preferably, at least 15 
consecutive bases which are complementary to the target. Most preferably, the antisense 
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oligonucleotides comprise a complementary sequence of 20-30 bases. Although 
oligonucleotides may be chosen which are antisense to any region of the gene or mRNA 
transcripts, in preferred embodiments the antisense oligonucleotides correspond to N-terminal or 
5' upstream sites such as translation initiation, transcription initiation or promoter sites. In 
5 addition, 3'-untranslated regions or telomerase sites may be targeted. Targeting to mRNA 

splicing sites has also been used in the art but may be less preferred if alternative mRNA splicing 
occurs. In addition, the antisense is targeted, preferably, to sites in which mRNA secondary 
structure is not expected (sec, e.g., Sainio et al., Cell Mol. Neurobiol. 14(5):439-457 (1994)) and 
at which proteins are not expected to bind. Finally, although, SEQ ID NOs: 1 and 3 disclose 

10 cDNA sequences, one of ordinary skill in the art may easily derive the genomic DNAs 

corresponding to the cDNAs of SEQ ID NOs: 1 and 3. Thus, the present invention also provides 
for antisense oligonucleotides which are complementary to the genomic DNAs corresponding to 
SEQ ID NOs:i and 3. Similarly, antisense to allelic or homologous cDNAs and genomic DNAs 
are enabled without undue experimentation. 

15 In one set of embodiments, the antisense oligonucleotides of the invention may be 

composed of "natural" deoxyribonucleotides, ribonucleotides, or any combination thereof. That 
is, the 5' end of one native nucleotide and the 3' end of another native nucleotide may be 
covalently linked, as in natural systems, via a phosphodiester internucleoside linkage. These 
oligonucleotides may be prepared by art recognized methods which may be carried out manually 

20 or by an automated synthesizer. 

In preferred embodiments, however, the antisense oligonucleotides of the invention also 
may include "modified" oligonucleotides. That is, the oligonucleotides may be modified in a 
number of ways which do not prevent them from hybridizing to their target bul which enhance 
their stability or targeting or which otherwise enhance their therapeutic effectiveness. 

25 The term "modified oligonucleotide" as used herein describes an oligonucleotide in 

which (1) at least two of its nucleotides are covalently linked via a synthetic internucleoside 
linkage (i.e., a linkage other than a phosphodiester linkage between the 5' end of one nucleotide 
and the 3' end of another nucleotide) and/or (2) a chemical group not normally associated with 
nucleic acids has been covalently attached to the oligonucleotide. Preferred synthetic 

30 internucleoside linkages include phosphorothioates, alkylphosphonates, phosphorodithioates, 
phosphate esters, alkvlphosphonothioates, phosphoramidates, carbamates, carbonates, phosphate 
triesters, acetamidates. and carboxymethyl esters. 
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The term "modified oligonucleotide" also encompasses oligonucleotides with a 
covalently modified base and/or sugar. For example, modified oligonucleotides include 
oligonucleotides having backbone sugars which are covalently attached to low molecular weight 
organic groups other than a hydroxy! group at the 3' position and other than a phosphate group at 
5 the 5' position. Thus modified oligonucleotides may include a 2-O-alkylated ribose group. In 
addition, modified oligonucleotides may include sugars such as arabinose instead of ribose. The 
present invention, thus, contemplates pharmaceutical preparations containing modified antisense 
molecules that arc complementary to and hybridizable with, under physiological conditions, 
nucleic acids encoding osteoclast transporter proteins, together with pharmaceutically acceptable 
10 carriers. 

Anlisense oligonucleotides may be administered as part of a pharmaceutical composition 
or medicament. Such a pharmaceutical composition or medicament may include the antisense 
oligonucleotides in combination with any standard physiologically and/or pharmaceutically 
acceptable carriers which are known in the art. The compositions should be sterile and contain a 

15 therapeutically effective amount of the antisense oligonucleotides in a unit of weight or volume 
suitable for administration to a patient. The term "pharmaceutically acceptable 1 ' means a non- 
toxic material that does not interfere with the effectiveness of the biological activity of the active 
ingredients. The term "physiologically acceptable" refers to a non-toxic material that is 
compatible with a biological system such as a cell, cell culture, tissue, or organism. The 

20 characteristics of the carrier will depend on the route of administration. Physiologically and 
pharmaceutically acceptable carriers include diluents, fillers, salts, buffers, stabilizers, 
solubilizers, and other materials which are well known in the art. 

The invention also involves expression vectors coding for osteoclast transporter proteins 
and fragments and variants thereof and host cells containing those expression vectors. Virtually 

25 any cells, prokaryotic or eukaryotic, which can be transformed with heterologous DNA or RNA 
and which can be grown or maintained in culture, may be used in the practice of the invention. 
Examples include bacterial cells such as K coli and mammalian cells such as mouse, hamster, 
pig, goat, primate, etc. They may be of a wide variety of tissue types, including mast cells, 
fibroblasts, oocytes and lymphocytes, and they may be primary cells or cell lines. Specific 

30 examples include CHO cells and COS cells. Cell-free transcription systems also may be used in 
lieu of cells. In gene therapy applications, human hematopoietic cells that are precursors of 
osteoclasts are contemplated. 
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As used herein, a "vector" may be any of a number of nucleic acids into which a desired 
sequence may be inserted by restriction and ligation for transport between different genetic 
environments or for expression in a host cell. Vectors are typically composed of DNA although 
RNA vectors are also available. Vectors include, but are not limited to, plasmids and phagemids. 

5 A cloning vector is one which is able to replicate in a host cell, and which is further characterized 
by one or more endonuclease restriction sites at which the vector may be cut in a determinable 
fashion and into which a desired DNA sequence may be ligated such that the new recombinant 
vector retains its ability to replicate in the host cell. In the case of plasmids, replication of die 
desired sequence may occur many times as the plasmid increases in copy number within the host 

10 bacterium or just a single time per host before the host reproduces by mitosis. In the case of 
phage, replication may occur actively during a lytic phase or passively during a lysogenic phase. 
An expression vector is one into which a desired DNA sequence may be inserted by restriction 
and ligation such that it is operably joined to regulatory sequences and may be expressed as an 
RNA transcript. Vectors may further contain one or more marker sequences suitable for use in 

15 the identification of cells which have or have not been transformed or transfected with the vector. 
Markers include, for example, genes encoding proteins which increase or decrease either 
resistance or sensitivity to antibiotics or other compounds, genes which encode enzymes whose 
activities are detectable by standard assays known in the art (e.g. B-galactosidase or alkaline 
phosphatase), and genes which visibly affect the phenotype of transformed or transfected cells, 

20 hots, colonies or plaques. Preferred vectors are those capable of autonomous replication and 
expression of the structural gene products present in the DNA segments to which they are 
operably joined. 

As used herein, a coding sequence and regulatory sequences are said to be "operably" 
joined when they are covalently linked in such a way as to place the expression or transcription 

25 of the coding sequence under the influence or control of the regulatory sequences. If it is desired 
that the coding sequences be translated into a functional protein, two DNA sequences are said to 
be operably joined if induction of a promoter in the 5' regulatory sequences results in the 
transcription of the coding sequence and if the nature of the linkage between the two DNA 
sequences does not (1) result in the introduction of a frame-shift mutation, (2) interfere with the 

30 ability of the promoter region to direct the transcription of the coding sequences, or (3) interfere 
with the ability of the corresponding RNA transcript to be translated into a protein. Thus, a 
promoter region would be operably joined to a coding sequence if the promoter region were 



WO 97/42321 PCT/US97/078S6 

-12- 

capable of effecting transcription of that DNA sequence such that the resulting transcript might 
be translated into the desired protein or polypeptide. 

The precise nature of the regulatory sequences needed for gene expression may vary 
between species or cell types, but shall in general include, as necessary, 5' non-transcribing and 
5 5' non-translating sequences involved with the initiation of transcription and translation 

respectively, such as a TATA box, capping sequence, CAAT sequence, and the like. Especially, 
such 5' non-transcribing regulatory sequences will include a promoter region which includes a 
promoter sequence for transcriptional control of the operably joined gene. Regulatory sequences 
may also include enhancer sequences or upstream activator sequences as desired. The vectors of 

10 the invention may optionally include 5' leader or signal sequences, 5* or 3'. The choice and design 
of an appropriate vector is within the ability and discretion of one of ordinary skill in the art. 

Expression vectors containing all the necessary elements for expression are commercially 
available and known to those skilled in the art. See Sanbrook et ah. Molecular Cloning: A 
Laboratory Manual . Second Edition, Cold Spring Harbor Laboratory Press, 1989. Cells are 

15 genetically engineered by the introduction into the cells of heterologous DNA (RNA) encoding 
the osteoclast transporter protein or fragment or variant thereof. That heterologous DNA (RNA) 
is placed under operable control of transcriptional elements to permit the expression of the 
heterologous DNA in the host cell. In still another aspect of the invention, a defective osteoclast 
or precursor thereof is treated with DNA in a manner to promote via homologous recombination 

20 intracellular! y the correction of a defective osteoclast transporter gene. 

Preferred systems for mRNA expression in mammalian cells are those such as pRc/CMV 
(available from Invitrogcn, San Diego, CA) that contain a selectable marker such as a gene that 
confers G418 resistance (which facilitates the selection of stably transfected cell lines) and the 
human cytomegalovirus (CMV) enhancer-promoter sequences. Additionally, suitable for 

25 expression in primate or canine cell lines is the pCEP4 vector (Invitrogen), which contains an 
Epstein Barr virus (EBV) origin of replication, facilitating the maintenance of plasmid as a 
multicopy extrachromosomal element. 

mRNA expression and transporter function can be tested using these vectors in a wide 
variety of mammalian cell lines. Preferred systems include cells derived from kidney tubules, 

30 including MDCK and 1MCD-3 cells (both available from ATCC). 

A variety of systems for expression of proteins in bacterial, yeast, mammalian, or insect 
cells have been described and are commercially available. Preferred systems include the 
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Glutathione-S-transferase (GST) Gene Fusion system available from Pharmacia Biotech, 
Piscataway, NJ. In this system a piasmid is constructed containing the protein sequence of 
interest (in this case, the transporter including the first extracellular domain) inserted in frame 
downstream of the 25 kDa GST domain from S. japonicum . Expression of the fusion protein can 
5 be induced in transfected bacterial cells and the fusion protein purified by affinity 

chromatography using Glutathione Sepharose 4B. Cleavage of the desired peptide from the GST 
sequences is achieved using a site specific protease whose recognition sequence is located 
immediately upstream from the cloning site. An alternative system which is desirable since it 
maintains eucaryotic-specific functions such as glycosylation is recombination into baculovirus. 
10 Standard protocols exist (c.f O'Reilly et al., Baculovirus Expression Vectors: A laboratory 
Manual, IRL/Oxford University Press, 1992) and vectors, cells, and reagents are commercially 
available. 

The invention also involves polypeptides which bind to osteoclast transporter proteins 
and in certain embodiments preferably to the extracellular domain of the proteins. Such binding 

15 partners can be used in screening assays to detect the presence or absence of the osteoclast 
transporter protein and in purification protocols to isolate osteoclast transporter proteins. Such 
binding partners also can be used to inhibit the native activity of the osteoclast transporter protein 
by binding to the extracellular domain of such proteins. Likewise, such binding partners can be 
used to selectively target drugs, toxins or other molecules to osteoclasts. In this manner, 

20 osteoclast activity may be selectively increased by drugs that would enhance osteoclast activity 
or selectively impaired by drugs that would inhibit osteoclast activity (e.g. toxins, antisense, 
etc.). In this manner, bone remodeling may be desirably affected. 

The invention, therefore, involves antibodies or fragments of antibodies having the ability 
to selectively bind to osteoclast transporter proteins, and preferably to the extracellular domain 

25 thereof. Antibodies include polyclonal and monoclonal antibodies, prepared according to 

conventional methodology. Antibodies were raised according to standard procedures. A fusion 
protein was prepared using the pGEX-4T-2 vector (Pharmacia, Piscataway, NJ). The fusion 
protein contains a glutathione-S-transferase (GST) fragment joined to the terminal 84 amino 
acids of the osteoclast transporter. The fusion protein was expressed and injected into rabbits to 

30 produce antibodies as rabbit immune serum. The antibodies so generated detected a protein of 
molecular weight which corresponds to the osteoclast transporter in tissue prepared from normal 
mouse kidneys. The protein is significantly reduced or absent in kidney tissue prepared from 
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oc/oc mutant mice, consistent with the absence of detectable osteoclast transporter mRNA by 
Northern blot in these mice. 

Significantly, as is well-known in the art, only a small portion of an antibody molecule, 
the paratope, is involved in the binding of the antibody to its epitope (see, in general, Clark, 

5 W.R. ( 1 986) The Experimental Foundations of M odern Immunology Wiley & Sons, Inc., New 
York; Roitt, I. (1991) JEssential Immunology . 7th Ed., Blackwell Scientific Publications, 
Oxford). The pFc' and Fc regions, for example, are effectors of the complement cascade but arc 
not involved in antigen binding. An antibody from which the pFc' region has been ertzymatically 
cleaved, or which has been produced without the pFc 1 region, designated an F(ab') 2 fragment, 

10 retains both of the antigen binding sites of an intact antibody. Similarly, an antibody from which 
the Fc region has been enzymatically cleaved, or which has been produced without the Fc region, 
designated an Fab fragment, retains one of the antigen binding sites of an intact antibody 
molecule. Proceeding further, Fab fragments consist of a covalently bound antibody light chain 
and a portion of the antibody heavy chain denoted Fd. The Fd fragments are the major 

15 determinant of antibody specificity (a single Fd fragment may be associated with up to ten 
different light chains without altering antibody specificity) and Fd fragments retain epitope- 
binding ability in isolation. 

Within the antigen-binding portion of an antibody, as is well-known in the art, there are 
complementarity determining regions (CDRs), which directly interact with the epitope of the 

20 antigen, and framework regions (FRs), which maintain the tertiary structure of the paratope (see, 
in general. Clark, 1986; Roitt, 1991). In both the heavy chain Fd fragment and the light chain of 
IgG immunoglobulins, there are four framework regions (FR1 through FR4) separated 
respectively by three complementarity determining regions (CDR1 through CDR3). The CDRs, 
and in particular the CDR3 regions, and more particularly the heavy chain CDR3, are largely 

25 responsible for antibody specificity. 

It is now well-established in the art that the non-CDR regions of a mammalian antibody 
may be replaced with similar regions of conspecific or heterospecific antibodies while retaining 
the epitopic specificity of the original antibody. This is most clearly manifested in the 
development and use of "humanized" antibodies in which non-human CDRs are covalently 

30 joined to human FR and/or Fc/pFc' regions to produce a functional antibody. Thus, for example, 
PCT International Publication Number WO 92/04381 teaches the production and use of 
humanized murine RS V antibodies in which at least a portion of the murine FR regions have 
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been replaced by FR regions of human origin. Such antibodies, including fragments of intact 
antibodies with antigen-binding ability, are often referred to as "chimeric" antibodies. 

Thus, as will be apparent to one of ordinary skill in the art, the present invention also 
provides for F(ab') 3 , Fab, Fv and Fd fragments; chimeric antibodies in which the Fc and/or FR 

5 and/or CDR1 and/or CDR2 and/or light chain CDR3 regions have been replaced by homologous 
human or non-human sequences; chimeric F(ab') 2 fragment antibodies in which the FR and/or 
CDR1 and/or CDR2 and/or light chain CDR3 regions have been replaced by homologous human 
or non-human sequences; chimeric Fab fragment antibodies in which the FR and/or CDR1 arid/or 
CDR2 and/or light chain CDR3 regions have been replaced by homologous human or non- 

10 human sequences; and chimeric Fd fragment antibodies in which the FR and/or CDR1 and/or 
CDR2 regions have been replaced by homologous human or non-human sequences. 1Tie present 
invention also includes so-called single chain antibodies. Thus, the invention involves 
polypeptides of numerous size and type that bind specifically to osteoclast transporter proteins. 
These polypeptides may be derived also from sources other than antibody technology. For 

15 example, such polypeptide binding agents can be provided by degenerate peptide libraries which 
can be readily prepared in solution, in immobilized form or as phage display libraries. 
Combinatorial libraries also can be synthesized of peptides containing one or more amino acids. 
Libraries further can be synthesized of peptoids and non-peptide synthetic moieties. 

Phage display can be particularly effective in identifying binding peptides useful 

20 according to the invention. Briefly, one prepares a phage library (using e.g. ml 3. fd, or lambda 
phage), displaying inserts from 4 to about 80 amino acid residues using conventional procedures. 
The inserts may represent a completely degenerate or biased array. One then can select phage- 
bearing inserts which bind to the osteoclast transporter protein. This process can be repeated 
through several cycles of reselection of phage that bind to the osteoclast transporter protein. 

25 Repeated rounds lead to enrichment of phage bearing particular sequences. DNA sequence 

analysis can be conducted to identify the sequences of the expressed polypeptides. The minimal 
linear portion of the sequence that binds to the osteoclast transporter protein can be determined. 
One can repeat the procedure using a biased library containing inserts containing part or all of the 
minimal linear portion plus one or more additional degenerate residues upstream or downstream 

30 thereof. Thus, the osteoclast transporter molecule of the invention, an extracellular domain 
thereof, or the like, can be used to screen peptide libraries, including phage display libraries, to 
identify and select peptide binding partners of the extracellular portion of the osteoclast 
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transporter protein of the invention. Such molecules can be used, as described, for screening 
assays, for diagnostic assays, for purification protocols, for interfering directly with the 
functioning of osteoclasts by binding to the osteoclast transporter protein or for targeting drugs, 
toxins and/or labeling agents (e.g. radioisotopes, fluorescent molecules, etc.) to osteoclasts 
5 and/or osteoclast transporter proteins. Drug molecules that would affect osteoclast activity and 
toxin molecules that would disable or destroy osteoclast cells arc known to those skilled in the 
art and arc commercially available. For example, the immunotoxin art provides examples of 
toxins which are effective when delivered to a cell by an antibody or fragment thereof. 
Examples of toxins include ribosome-damaging toxins derived from plants or bacterial such as 

10 ricin, abrin, saporin, Pseudomonas endotoxin, diphtheria toxin, A chain toxins, blocked ricin, etc. 
Osteoclast activity can be assayed both in vitro and in vivo. Heterogeneous culture 
systems in which bone marrow derived cells are co-cultured with osteoblast ceils in the presence 
of 1,25-dihydroxy vitamin D3 will generate mature osteoclasts, whose activity can be quantitated 
by measuring their ability to form resorption pits on dentine slices (Takahashi et al., 

15 Endocrinology ; Vol. 122, No. 4, 1988). In mammals, decreased osteoclast function results in the 
clinical condition of osteopetrosis. In rodents, this can be readily scored by virtue of the fact 
that, in this case, incisors fair to erupt after birth. Thus, rodent neonates can be treated with 
compositions of the invention and the effect on osteoclast activity can be determined by scoring 
incisor eruption. The degree of osteopetrosis can be more accurately quantitated by 

20 morphometrical analysis of the bone marrow space (Boyce et al., ./. 'Clin. Invest.. Vol. 90 r 1 992). 
When used therapeutically, the compounds of the invention are administered in 
therapeutically effective amounts. In general, a therapeutically effective amount means that 
amount necessary to delay the onset of, inhibit the progression of, or halt altogether the particular 
condition being treated. Therapeutically effective amounts specifically will be those which 

25 desirably influence osteoclast activity. When it is desired to decrease osteoclast activity, then 
any inhibition of osteoclast activity is regarded as a therapeutically effective amount. When it is 
desired to increase osteoclast activity, then any enhancement of osteoclast activity is regarded as 
a therapeutically effective amount. Generally, a therapeutically effective amount will van' with 
the subject's age, condition, and sex, as well as the nature and extent of the disease in the subject, 

30 all of which can be determined by one of ordinary skill in the art. The dosage may be adjusted 
by the individual physician or veterinarian, particularly in the event of any complication. A 
therapeutically effective amount typically varies from 0.01 mg/kg to about 1000 mg/kg, 
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preferably from about 0.1 mg/kg to about 200 mg/kg and most preferably from about 0.2 mg//kg 
to about 20 mg/kg, in one or more dose administrations daily, for one or more days. 

The therapeutics of the invention can be administered by any conventional route, 
including injection or by gradual infusion over time. The administration may, for example, be 
oral, intravenous, intraperitoneal, intramuscular, intracavity, subcutaneous, or transdermal. 
When antibodies are used therapeutically, a preferred route of administration is by pulmonary 
aerosol. Techniques for preparing aerosol delivery systems containing antibodies are well 
known to those of skill in the art. Generally, such systems should utilize components which will 
not significantly impair the biological properties of the antibodies, such as the paratope binding 
capacity (see, for example, Sciarra and Cutie, "Aerosols, 1 ' in Remington'? Pharmaceutical 
Sciences . 18th edition, 1990, pp 1694-1712; incorporated by reference). Those of skill in the art 
can readily determine the various parameters and conditions for producing antibody aerosols 
without resort to undue experimentation. When using antisense preparations of the invention, 
slow intravenous administration is preferred. 

Preparations for parenteral administration include sterile aqueous or non-aqueous 
solutions, suspensions, and emulsions. Examples of non-aqueous solvents are propylene glycol, 
polyethylene glycol, vegetable oils such as olive oil, and injectable organic esters such as ethyl 
oleate. Aqueous carriers include water, alcoholic/aqueous solutions, emulsions or suspensions, 
including saline and buffered media. Parenteral vehicles include sodium chloride solution, 
Ringer's dextrose, dextrose and sodium chloride, lactated Ringer's or fixed oils. Intravenous 
vehicles include fluid and nutrient replenishes, electrolyte replenishers (such as those based on 
Ringer's dextrose), and the like. Preservatives and other additives may also be present such as, 
for example, antimicrobials, anti-oxidants, chelating agents, and inert gases and the like. 

The invention also contemplates gene therapy. The procedure for performing ex vivo 
gene therapy is outlined in U.S. Patent 5,399,346 and in exhibits submitted in the file history of 
that patent, all of which are publicly available documents. In general, it involves introduction in 
vitro of a functional copy of a gene into a cell(s) of a subject which contains a defective copy of 
the gene, and returning the genetically engineered cell(s) to the subject. The functional copy of 
the gene is under operable control of regulatory elements which permit expression of the gene in 
the genetically engineered cell(s). Numerous transfection and transduction techniques as well as 
appropriate expression vectors are well known to those of ordinary skill in the art, some of which 
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are described in PCX application WO95/00654. In vivo gene therapy using vectors such as 
adenovirus also is contemplated according to the invention. 

Thus, defective osteoclasts or precursors thereof are provided with a non-defective 
nucleic acid encoding an osteoclast transporter protein. In particular, such gene therapy is 
5 appropriate for hereditary forms of osteopetrosis involving defective osteoclast transporter 
protein genes. For example, primary human blood cells which are precursors of osteoclasts can 
be obtained from the bone marrow of a subject who is a candidate for such gene therapy. 
Candidates can be identified by screening for abnormal osteoclast function that results from a 
defective osteoclast transporter protein. Then, such cells can be genetically engineered ex vivo 

10 with DNA (RMA) encoding a normal osteoclast transporter protein. The genetically engineered 
cells then arc returned to the patient. 

Thai the transporter can be used to correct the defect in the clinical conditions of 
osteopetrosis in which it has been inactivated is demonstrated using a mouse model system. The 
osteosclerosis (oc) mutant mouse fails to express the transporter and has osteopetrosis. The 

15 transporter is introduced into a vector such as pXT] (Strategene, La Jolla, CA) that, when 
transfected into an appropriate packaging cell line, will generate a higher titer of replication- 
defective retrovitral particles carrying the transporter under the regulatory control of a thymidine 
kinase promoter. Hematopoietic progenitor cells from oc mutant mice are prepared and, using 
well-documented techniques of infection and selection, stem cells carrying this vector are 

20 selected after coculture with the retrovirus. Re-population of lethally irradiated oc mutant mice 
with this transfected population is accomplished using documented protocols of hematopoietic 
transplantation, and the degree of amelioration of the osteopetrosis phenotype then is determined 
by radiography and morphometrical analysis. 

While the invention has been described with respect to certain embodiments, it should be 

25 appreciated that many modifications and changes may be made by those of ordinary skill in the 
art without departing from the spirit of the invention. It is intended that such modification, 
changes and equivalents fall within the scope of the following claims. 
A Sequence Listing is provided, followed by what is claimed: 
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(1) GENERAL INFORMATION : 

5 (i) APPLICANT: 

(A) NAME: BRIGHAM AND WOMEN'S HOSPITAL, INC. 

(B) STREET: 75 FRANCIS STREET 

(C) CITY: BOSTON 

(D) STATE: MASSACHUSETTS 

10 (E) COUNTRY: UNITED STATES OF AMERICA 

(F) POSTAL CODE: 02115 

(i) APPLICANT/ INVENTOR: 

(A) NAME: BEIER, DAVID R. 
j 5 (B) STREET: 39 ARLINGTON ROAD 

(C) CITY: BROOKLINE 

(D) STATE: MASSACHUSETTS 

(E) COUNTRY: UNITED STATES OF AMERICA 

(F) POSTAL CODE: 02146 

20 

(i) APPLICANT/ INVENTOR: 

(A) NAME: BRADY, KEVIN P. 

(B) STREET: 3N BENNET COURT 

(C) CITY: BOSTON 

25 (D) STATE: MASSACHUSETTS 

(E) COUNTRY: UNITED STATES OF AMERICA 

(F) POSTAL CODE: 02113 



30 



<ii) TITLE OF INVENTION: OSTEOCLAST TRANSPORTER 
(iii) NUMBER OF SEQUENCES : 7 
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(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: WOLF, GREENFIELD & SACKS, P.C. 

(B) STREET: 600 ATLANTIC AVENUE 

(C) CITY: BOSTON 

(D) STATE: MASSACHUSETTS 

(E) COUNTRY: UNITED STATES OF AMERICA 

(F) POSTAL CODE: 02210 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: Patentln Release #1.0, Version #1.25 



(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 

(C) CLASSIFICATION: 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: US 08/647,397 

(B) FILING DATE: 09-MAY-1996 

(viii) ATTORNEY/ AGENT INFORMATION: 

(A) NAME: Gates, Edward R. 

(B) REGISTRATION NUMBER: 31,616 

(C) REFERENCE/DOCKET NUMBER: B0801/7048WO 

( ix ) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: 617-720-3500 

(B) TELEFAX: 617-720-2441 
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(2) INFORMATION FOR SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2102 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 



15 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Mus musculus 

(ix) FEATURE: 

(A) NAME/KEY: CDS 
20 (B) LOCATION: 120.. 1733 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 
TGOGAGCGTG CIACTACAGC AGCTOCTGAA CCIAGACAGG CACGGCAACT GCTGCATCCA 60 

25 

GCTCCAGCCC AACTGAATCC AGCTCCAACC ACCAGTTTTG GTTCATCITG CCTGGTGCC 



ATG ACC TTC TCC GAG ATT CTG GAC CCT GTT GGA AGC ATG GGC CCC TTC 
Met Thr Phe Ser Glu lie Leu Asp Arg Val Gly Ser Met Gly Pro Phe 

c 10 15 

30 1 5 



119 



167 



CAG TAC 



CTG CAT GTG ACC TTG CTG GCC CTC CCA ATC CTC GGA ATA GCC 



215 
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Gin Tyr Leu His Val Thr Leu Leu Ala Leu Pro lie Leu Gly He Ala 



20 



25 30 



AAC CAC AAC TTG CTA CAG ATC TTC ACA GCC ACC ACC CCT GAC CAC CAC 
5 Asn His Asn Leu Leu Gin lie Phe Thr Ala Thr Thr Pro Asp His His 

40 45 



35 



TGT CGC CCG CCC CCC AAC GCC TCP CTA GAG CCC TGG GTA CTC CCC TTG 
- Cys Arg Pro Pro Pro Asn Ala Ser Leu Glu Pro Trp Val Leu Pro Leu 
50 55 60 

GGC CCA AAC GGG AAG CCT GAG AAG TGT CTC CGC TTC GTG CAT CTG CCA 
Gly Pro Asn Gly Lys Pro Glu Lys Cys Leu Arg Phe Val His Leu Pro 

75 80 



65 70 



AAC GCC AGT CTT CCC AAT GAC ACC CAG GGG GCC ACC GAG CCA TGC TTG 
Asn Ala Ser Leu Pro Asn Asp Thr Gin Gly Ala Thr Glu Pro Cys Leu 
85 



90 95 



20 GAT GGC TGG ATC TAC AAC AGC ACC AGA GAC ACC ATT GTG ACA GAG TGG 
Asp Gly Trp lie Tyr Asn Ser Thr Arg Asp Thr lie Val Thr Glu Trp 



100 



105 H° 



GAC TTG GTA TGC GGC TCC AAC AAA CTG AAG GAG ATG GCA CAG TCA GTC 
25 Asp Leu Val Cys Gly Ser Asn Lys Lau Lys Glu Met Ala Gin Ser Val 



115 



120 



125 



TTC ATG GCA GGT ATA CTG GTT GGA GGA CCT GTG TTT GGA GAA CTG TCA 
Phe Met Ala Gly He Leu Val Gly Gly Pro Val Phe Gly Glu l*u Ser 
30 130 135 140 

GAC AGG TTT GGC CGC AAG CCC ATC CTG ACC TGG AGC TAT CTC TTG CTG 



263 



311 



359 



407 



455 



503 



551 



599 
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Asp Arg Phe Gly Arg Lys Pro He Leu Thr Trp Ser Tyr Leu Leu Leu 
145 150 155 160 

GCA GCC ACT GGC TCC ACT GCT GCC TTC AGC CCC AGC CTC ACT GTC TAT 647 
5 Ala Ala Ser Gly Ser Ser Ala Ala Phe Ser Pro Ser Leu Thr Val Tyr 

165 170 175 

ATG' ATC TTC CGA TTC CTG TGT GGC TGC AGC ATC TCG GGC ATT TCT CTG 695 
Met He Phe Arg Phe Leu Cys Gly Cys Ser He Ser Gly He Ser Leu 
10 180 185 190 

AGC ACC ATT ATC TTG AAT GTG GAA TOG GTA CCC ACC TCC ACG COG GCC 743 
Ser Thr He He Leu Asn Val Glu Trp Val Pro Thr Ser Thr Arg Ala 
195 200 205 

15 

ATC TCA TCA ACA ACT ATT GGG TAC TGC TAC ACC ATT GOT CAA TTC ATT 791 
He Ser Ser Thr Thr He Gly Tyr Cys Tyr Thr He Gly Gin Phe He 
210 215 220 

20 CTG CCT GGC CTG GCC TAT GCC GTT CCT CAG TGG CGC TGG CTA CAG TTG 839 
Leu Pro Gly Leu Ala Tyr Ala Val Pro Gin Trp Arg Trp Leu Gin Leu 
225 230 235 240 

TCC GTG TCT GCT GCC TTC TTC ATC TTC TCC TTG TTG TCC TGG TGG GTA 887 
25 Ser Val Ser Ala Ala Phe Phe He Phe Ser Leu Leu Ser Trp Trp Val 

245 250 255 

CCA GAG TCC ATA CGC TGG CTG GTT CTG TCT GGA AAA TTC TCA CGA GCT 935 
Pro Glu Ser He Arg Trp Leu Val Leu Ser Gly Lys Phe Ser Arg Ala 
30 260 265 270 

CTG AAG ACA CTC CAA CGT GTG GCT ACC TTC AAC GGC AAG AAG GAG GAA 983 
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Leu Lys Thr Leu Gin Arg Val Ala Thr Phe Asn Gly Lys Lys Glu Glu 
275 280 285 



GOG GAA AAG CTC ACT GTG GAG GAG CTG AAG TTC AAC TTG CAG AAG GAC 1031 
5 Gly Glu Lys Leu Thr Val Glu Glu Leu Lys Phe Asn Leu Gin Lys Asp 
290 295 300 

ATC ACC TCA GCC AAG GTC AAA TAT GGC TTA TCT GAC TTG TTC CGA GTG 1079 
lie Thr Ser Ala Lys Val Lys Tyr Gly Leu Ser Asp Leu Phe Arg Val 
10 305 310 315 320 

TCC ATC CTG CGC CGT GTG ACC TTC TGT CTC TCT CTG GCC TGG TIT GCT 1127 
Ser lie Leu Arg Arg Val Thr Phe Cys Leu Ser Leu Ala Trp Phe Ala 
325 330 335 

15 

ACT GGC TIT GCC TAC TAC AGT TTG GCT ATG GGA GTA GAA GAA TTT GGA 1175 
Thr Gly Phe Ala Tyr Tyr Ser Leu Ala Met Gly Val Glu Glu Phe Gly 
340 345 350 

20 GTC AAC ATC TAC ATA CTC CAG ATC ATC TTC GGT GGG GTT GAC ATT CCC 1223 
Val Asn lie Tyr lie Leu Gin lie lie Phe Gly Gly Val Asp lie Pro 
355 360 365 

GCC AAG TTC ATC ACA ATC CTC TCC ATA AGT TAT CTG GGC CGG CGC ATC 1271 
25 Ala Lys Phe lie Thr lie Leu Ser lie Ser Tyr Leu Gly Arg Arg lie 
370 375 380 

ACT CAG GGC TTC CTC CTG ATC CTG GCA GGA GTG GCC ATC CTG GCC CTC 1319 
Thr Gin Gly Phe Leu Leu lie Leu Ala Gly Val Ala He Leu Ala Leu 
30 385 390 395 400 

ATC TTT GTG TCT TCA GAA ATG CAG CTC TTG AGA ACA GCA CTG GCT GTA 1367 
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Ile Phe Val Ser Ser Glu Met Gin Leu Leu Arg Thr Ala Leu Ala Val 
405 410 415 

TIT GGG AAG GGA TGC CTG TCT GGC TCC TTC AGC TGC CTC TTC CTC TAC 1415 
5 Phe Gly Lys Gly Cys Leu Ser Gly Ser Phe Ser Cys Leu Phe Leu Tyr 
420 425 430 

ACA AGT GAG CTC TAC CCT ACA GTC CTC AGG CAA ACA GGT ATG GGT ATC 1463 
Thr Ser Glu Leu Tyr Pro Thr Val Leu Arg Gin Thr Gly Met Gly lie 
10 435 440 445 

AGT AAC ATA TGG GCT CGA GTG GGA AGT ATG ATA GCC CCA CTG GTG AAA 1511 

Ser Asn He Trp Ala Arg Val Gly Ser Met He Ala Pro Leu Val Lys 

450 455 460 

15 

ATC ACG GGA GAA CTG CAG CCC TTC ATC CCT AAT GTC ATC TTT TGG ACC 1559 

He Thr Gly Glu Leu Gin Pro Phe He Pro Asn Val He Phe Trp Thr 

465 470 475 480 

20 ATG ACT CPA CTG GGA GGC AGT GCT GCC TTC TTT CTG CTT GAG ACC CTC 1607 
Met Thr Leu Leu Gly Gly Ser Ala Ala Phe Phe Leu Leu Glu Thr Leu 
485 490 495 

AAT CGG CCC TTA CCA GAA ACT ATC GAG GAC ATA CAA GAC TGG TAC CAG 1655 
25 Asn Arg Pro Leu Pro Glu Thr He Glu Asp He Gin Asp Trp Tyr Gin 
500 505 510 

CAA ACC AAG AAA ACA AAG CAG GAG CCA GAA GCA GAA AAG GCA TCC CAG 1703 
Gin Thr Lys Lys Thr Lys Gin Glu Pro Glu Ala Glu Lys Ala Ser Gin 
30 515 520 525 



ACA ATC CCG CTG AAG ACT GGT GGA CCC TAGCTAAGAA CAACAGAATC 



1750 
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Thr lie Pro Leu Lys Thr Gly Gly Pro 
530 535 



CTCTTTCCTG CCTTCCAGAG ACTGATCCCA AGCAGGGCCC TTCCAAGGCT ATTCGAGCAC 1810 

5 

CITAGGGGTT GGGTGGAGCC CCAGCTGGCT CCATGCTCTC AGAACAAAGA CTTCTGAGAG 1870 

TTCAGCAAAA GTGTTITACC TTCACCACCT CCACCGTAGC CXZACAACCCA GACCTGGCCT 1930 

10 GTTCACAGCC CTAGCCATAC TCACTCCTGC ACTCATCCTC CCTGCAACCC AGGCCCTGCC 1990 

ATTTTTCTCT ACCCTCTTTC TATTGGCCAT TTCCTCCATT GTCCCA(TCTC CATTTCCCTT 2050 

TGAGATTCCC TGGCAC7ITCT AATGGTTTCC TCTTACCCTC CCCCTCGTGC CG 2102 

15 

(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 
20 (A) LENGTH: 537 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



25 



(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:2: 



Met Thr Phe Ser Glu lie Leu Asp Arg Val Gly Ser Met Gly Pro Phe 
15 10 15 

30 

Gin Tyr Leu His Val Thr Leu Leu Ala Leu Pro lie Leu Gly lie Ala 
20 25 30 
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Asn His Asn Leu Leu Gin He Phe Thr Ala Thr Thr Pro Asp His His 
35 40 45 

Cys Arg Pro Pro Pro Asn Ala Ser Leu Glu Pro Trp Val Leu Pro Leu 
50 55 60 

Gly Pro Asn Gly Lys Pro Glu Lys Cys Leu Arg Phe Val His Leu Pro 
65 70 75 80 

Asn Ala Ser Leu Pro Asn Asp Thr Gin Gly Ala Thr Glu Pro Cys Leu 
85 90 95 

Asp Gly Trp He Tyr Asn Ser Thr Arg Asp Thr He Val Thr Glu Trp 
100 105 110 

Asp Leu Val Cys Gly Ser Asn Lys Leu Lys Glu Met Ala Gin Ser Val 
115 120 125 

Phe Met Ala Gly He Leu Val Gly Gly Pro Val Phe Gly Glu Leu Ser 
130 135 140 

Asp Arg Phe Gly Arg Lys Pro He Leu Thr Trp Ser Tyr Leu Leu Leu 
145 150 155 160 

Ala Ala Ser Gly Ser Ser Ala Ala Phe Ser Pro Ser Leu Thr Val Tyr 
165 170 175 

Met He Phe Arg Phe Leu Cys Gly Cys Ser He Ser Gly He Ser Leu 
180 185 190 



Ser Thr He He Leu Asn Val Glu Trp Val Pro Thr Ser Thr Arg Ala 
195 200 205 
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He Ser Ser Thr Thr He Gly Tyr Cys Tyr Thr He Gly Gin Phe He 
210 



215 220 



Leu Pro Gly Leu Ala Tyr Ala Val Pro Gin Trp Arg Trp Leu Gin Leu 
5 225 



230 235 240 



Ser Val Ser Ala Ala Phe Phe He Phe Ser Leu Leu Ser Trp Trp Val 
245 250 255 

,0 Pro Glu Ser He Arg Trp Leu Val Leu Ser Gly Lys Phe Ser Arg Ala 
260 265 270 

Thr Leu Gin Arg Val Ala Thr Phe Asn Gly Lys Lys Glu Glu 



Leu Lys 

275 
15 

Gly Glu Lys Leu Thr Val Glu Glu Leu Lys Phe Asn Leu Gin Lys Asp 
290 



280 285 



295 300 



He Thr Ser Ala Lys Val Lys Tyr Gly Leu Ser Asp Leu Phe Arg Val 
20 305 



310 315 320 



Ser He Leu Arg Arg Val Thr Phe Cys Leu Ser Leu Ala Trp Phe Ala 
325 330 335 



Thr Gly Phe Ala Tyr Tyr Ser Leu Ala Met Gly Val Glu Glu Phe Gly 
340 



25 

345 350 



30 



Val Asn He Tyr He Leu Gin He He Phe Gly Gly Val Asp He Pro 
355 360 365 

Ala Lys Phe He Thr He Leu Ser He Ser Tyr Leu Gly Arg Arg He 
370 375 380 
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Thr Gin Gly Phe Leu Leu He Leu Ala Gly Val Ala He Leu Ala Leu 
385 390 400 



He Phe Val Ser Ser Glu Met Gin Leu Leu Arg Thr Ala Leu Ala Val 
5 405 410 415 

Phe Gly Lys Gly Cys Leu Ser Gly Ser Phe Ser Cys Leu Phe Leu Tyr 
420 425 430 

10 Thr Ser Glu Leu Tyr Pro Thr Val Leu Arg Gin Thr Gly Met Gly He 
435 440 445 

Ser Asn He Trp Ala Arg Val Gly Ser Met He Ala Pro Leu Val Lys 
450 455 460 

15 

He Thr Gly Glu Leu Gin Pro Phe He Pro Asn Val He Phe Trp Thr 
465 470 475 480 

Met Thr Leu Leu Gly Gly Ser Ala Ala Phe Phe Leu Leu Glu Thr Leu 
20 485 490 495 

Asn Arg Pro Leu Pro Glu Thr He Glu Asp He Gin Asp Trp Tyr Gin 
500 505 510 

25 Gin Thr Lys Lys Thr Lys Gin Glu Pro Glu Ala Glu Lys Ala Ser Gin 
515 520 525 

Thr He Pro Leu Lys Thr Gly Gly Pro 
530 535 

30 

(2) INFORMATION FOR SEQ ID NO: 3: 
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Ci) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2135 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTI- SENSE: NO 

<vi) ORIGINAL SOURCE : 

(A) ORGANISM: Homo sapiens 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 73.. 1758 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO:3: 
GAATTCOGGT GATCACCAGC CCCATOGGAT CCAGACCCGG CCACCAGCTC TGGCTOGTCT 60 
TGCCCCAGTG CC ATG ACC TTC TOG GAG ATC CTG GAC CGT GTC GGA AGC 108 



Met Thr Phe Ser Glu lie Leu Asp Arg Val Gly Ser 



1 



5 



10 



ATG GGC CAT TTC CAG TTC CTG CAT GTA GCC ATA CTG GGC CTC COG ATC 



156 



Met Gly His Phe Gin Phe Leu His Val Ala lie Leu Gly Leu Pro He 



15 



20 



25 



CTC AAC ATG GCC AAC CAC AAC CTG CTG CAG ATC TTC ACA GCC GCC ACC 



204 
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Leu Asn Met Ala Asn His Asn Leu Leu Gin lie Phe Thr Ala Ala Thr 
30 35 40 



CCT GTC CAC CAC TCT CGC CCG CCC CAC AAT GCC TCC ACA GGG OCT TGG 
5 Pro Val His His Cys Arg Pro Pro His Asn Ala Ser Thr Gly Pro Trp 
45 50 55 60 



252 



GTG CTC CCC ATG GGC CCA AAT GGG AAG CCT GAG AGG TCC CTC CGT TIT 
Val Leu Pro Met Gly Pro Asn Gly Lys Pro Glu Arg Cys Leu Arg Phe 
10 65 70 75 



300 



15 



GTA CAT CCG CCC AAT GCC AGC CTC CCC AAT GAC ACC CAG AGG GCC ATG 348 
Val His Pro Pro Asn Ala Ser Leu Pro Asn Asp Thr Gin Arg Ala Met 
80 85 90 

GAG CCA TGC CTC GAT GGC TGG GTC TAC AAC AGC ACC AAG GAC TCC ATT 396 
Glu Pro Cys Leu Asp Gly Trp Val Tyr Asn Ser Thr Lys Asp Ser lie 
95 100 105 



20 GTG ACA GAG TGG GAC TTG GTG TGC AAC TCC AAC AAA CTG AAG GAG ATG 444 
Val Thr Glu Trp Asp Leu Val Cys Asn Ser Asn Lys Leu Lys Glu Met 
110 115 120 



GCC CAG TCT ATC TTC ATG GCA GGT ATA CTG ATT GGA GGG CTC GTG CTT 492 
25 Ala Gin Ser He Phe Met Ala Gly He Leu He Gly Gly Leu Val Leu 
125 130 135 140 

GGA GAC CTG TCT GAC AGG TTT GGC CGC AGG CCC ATC CTG ACC TGC AGC 540 
Gly Asp Leu Ser Asp Arg Phe Gly Arg Arg Pro He Leu Thr Cys Ser 
30 145 150 155 



TAC CTG CTG CTG GCA GCC AGC GGC TCC GGT GCA GCC TTC AGC CCC ACC 



588 
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Tyr Leu Leu Leu Ala Ala Ser Gly Ser Gly Ala Ala Phe Ser Pro Thr 
160 165 170 

TTC CCC ATC TAC ATG GTC TTC GGC TTC CTG TGT GGC TIT GGC ATC TCA 
5 Phe Pro He Tyr Met Val Phe Arg Phe Leu Cys Gly Phe Gly He Ser 
175 180 185 

GGC ATT ACC CTG AGC ACC GTC ATC TTG AAT GTG GAA TGG GTG CCT ACC 
Gly He Thr Leu Ser Thr Val lie Leu Asn Val Glu Trp Val Pro Thr 
,0 190 195 200 

CGG ATG CGG GCC ATC ATG TCG ACA GCA CTC GGG TAC TGC TAC ACC TIT 
Arg Met Arg Ala lie Met Ser Thr Ala Leu Gly Tyr Cys Tyr Thr Phe 

mn 215 220 

205 210 

15 

GGC GAG TTC ATT CTG CCC GGC CTG GCC TAC GCC ATC CCC CAG TGG OGT 
Gly Gin Phe lie Leu Pro Gly Ueu Ala Tyr Ala lie Pro Gin Trp Arg 
225 230 235 

20 TGG CTG CAG TTA ACT GTG TCC ATT CCC TTC TTC CTC TTC TTC CTA TCA 
Trp Leu Gin Leu Thr Val Ser lie Pro Phe Phe Val Phe Phe Leu Ser 
240 245 250 

TCC TGG TGG ACA CCA GAG TCC ATA CGC TGG TTG GTC TTG TCP GGA.AAG 
25 Ser Trp Trp Thr Pro Glu Ser He Arg Trp Leu Val Leu Ser Gly Lys 
255 



260 265 



TCC TOG AAG GCC CTG AAG ATA CTC CGG CGG GTT GGC TGT CTT CAA TGG 
Ser Ser Lys Ala Leu Lys lie Leu Arg Arg Val Gly Cys Leu Gin Trp 



275 



280 



30 270 

CAA GAA GGA AGA AGG AGA AAG CTC AGC TTG GAA GAG CTC AAA CTC AAC 



636 



684 



732 



780 



B28 



876 



924 



972 
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Gln Glu Gly Arg Arg Arg Lys Leu Ser Leu Glu Glu Leu Lys Leu Asn 

285 290 295 300 

CTG CAG AAG GAG ATC TCC TTG GCC AAG GCC AAT TAC ACC GCA AGT GAC 1020 
5 Leu Gin Lys Glu lie Ser Leu Ala Lys Ala Asn Tyr Thr Ala Ser Asp 

305 310 315 

CTG TTC CGG ATA CCC ATG CTG CGC CGC ATG ACC TTC TGT CTT TCC CTG 1068 
Leu Phe Arg lie Pro Met Leu Arg Arg Met Thr Phe Cys Leu Ser Leu 
10 320 325 330 

GCC TGG TIT GCT ACC GGT TTT GCC TAC TAT ACT TTG GCT ATG GGT GTG 1116 
Ala Trp Phe Ala Thr Gly Phe Ala Tyr Tyr Ser Leu Ala Met Gly Val 
335 340 345 

15 

GAA GAA TTT GGA GTC AAC CTC TAC ATC CTC CAG ATC ATC TTT GGT GGG 1164 
Glu Glu Phe Gly Val Asn Leu Tyr lie Leu Gin lie lie Phe Gly Gly 
350 355 360 

20 GTC GAT GTC CCA GCC AAG TTC ATC ACC ATC CTC TCC TTA AGC TAC CTG 1212 
Val Asp Val Pro Ala Lys Phe lie Thr lie Leu Ser Leu Ser Tyr Leu 
365 370 375 380 

GGC CGG CAT ACC ACT CAG GCC GCT GCC CTG CTC CTG GCA GGA GGG GCC 1260 
25 Gly Arg His Thr Thr Gin Ala Ala Ala Leu Leu Leu Ala Gly Gly Ala 

385 390 395 

ATC TTG GCT CTC ACC TTT GTG CCC TTG GAC TTG CAG ACC GTG AGG ACA 1308 
lie Leu Ala Leu Thr Phe Val Pro Leu Asp Leu Gin Thr Val Arg Thr 
30 400 405 410 



GTA TTG GCT GTG TTT 



GGG AAG 



GGA TGC CPA TCC AGC TCC TTC AGC TGC 



1356 
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Val Leu Ala Val Phe Gly Lys Gly Cys Leu Ser Ser Ser Phe Ser Cys 
415 



420 425 



CTC TTC CTC TAC ACA AGT GAA TTA TAC CCC ACA GTC ATC AGG CAA ACA 
5 Leu Phe Leu Tyr Thr Ser Glu Leu Tyr Pro Thr Val He Arg Gin Thr 
430 435 440 

GGT ATG GGC GTA AGT AAC CTG TGG ACC CGC GTG GGA AGC ATG GTG TCC 

Gly Met Gly Val Ser Asn Leu Trp Thr Arg Val Gly Ser Met Val Ser 

4S0 455 460 

10 445 450 

CCG CTG GTG AAA ATC AGG GGT GAG GTA CAG CCC TTC ATC CCC AAT ATC 
Pro Leu Val Lys lie Thr Gly Glu Val Gin Pro Phe lie Pro Asn lie 
465 



470 475 



15 



ATC TAC GGG ATC ACC GCC CTC CTC GGG GGC AGT GCT GCC CTC TTC CTG 
lie Tyr Gly He Thr Ala Leu Leu Gly Gly Ser Ala Ala Leu Phe Leu 
480 



485 490 



20 CCT GAG ACC CTG AAT CAG CCC TTG CCA GAG ACT ATC GAA GAC CTG GAA 
Pro Glu Thr Leu Asn Gin Pro Leu Pro Glu Thr lie Glu Asp Leu Glu 
495 500 505 

AAC TGG TCC CTG CGG GCA AAG AAG CCA AAG CAG GAG CCA GAG GTG GAA 
25 Asn Trp Ser Leu Arg Ala Lys Lys Pro Lys Gin Glu Pro Glu Val Glu 



510 



515 



520 



AAG GCC TCC CAG AGG ATC CTG TAC AGC CTC ACG GAC CAG GCC TGG GCT 
Lys Ala Ser Gin Arg He Leu Tyr Ser Leu Thr Asp Gin Ala Trp Ala 
30 525 

CCA GCT GAG GAC AAC GGA ACC CCC TTT CCC TGC CCT CCA GAG ACT GAT 



1404 



1452 



1500 



1548 



1596 



1644 



1692 



1740 
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Pro Ala Glu Asp Asn Gly Thr Pro Phe Pro Cys Pro Pro Glu Thr Asp 
545 550 555 

CCT AGC CAG GCA CCT TAGGAGTATA GGGAGGCCCC ATATAGGTCC ATCCTCCTAG 1795 
5 Pro Ser Gin Ala Pro 
560 

GATGAAGCCT TCTGAGAGCT TGGTGAAGGT GTCTCCATCA CCACCACCAG AGCCTCCTGC 1855 
10 CCAGCCCTGG CCAGTTCAAA GGTTCAGCCA TCCCTGCCCT TGTTCTCCCT GCAACCCAGG 1915 
CCCTGCCATT CTTCTGTCTA GCCCTTCCCC ACTGGCCACC TTCCCCCACT GTCCOGGTCC 1975 
TCTTCCCCTG AGGTCCCCTG ATATCCCCTG GCTGAGTCCT AACAAGACTG AGTCTTAACA 2035 

15 

AGATGAGAAG TCCTCCCCTT CTTGCCTCCC ACACTTTTCT TTGATGGGAG GTTTCAATAA 2095 
ACAGOGATAA GAACTCTAAA AAAAAAAAAA ACCGGAATTC 2135 

20 

(2) INFORMATION FOR SEQ ID NO:4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 561 amino acids 
25 (B) TYPE: amino acid 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

30 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

Met Thr Phe Ser Glu lie Leu Asp Arg Val Gly Ser Met Gly His Phe 
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1 5 10 15 

Gin Phe Leu His Val Ala He Leu Gly Leu Pro He Leu Asn Met Ala 
20 25 30 

Asn His Asn Leu Leu Gin He Phe Thr Ala Ala Thr Pro Val His His 
35 40 45 

Cys Arg Pro Pro His Asn Ala Ser Thr Gly Pro Trp Val Leu Pro Met 
50 55 60 

Gly Pro Asn Gly Lys Pro Glu Arg Cys Leu Arg Phe Val His Pro Pro 
65 . 70 75 80 

Asn Ala Ser Leu Pro Asn Asp Thr Gin Arg Ala Met Glu Pro Cys Leu 
85 90 95 

Asp Gly Trp Val Tyr Asn Ser Thr Lys Asp Ser He Val Thr Glu Trp 
100 105 HO 

Asp Leu Val Cys Asn Ser Asn Lys Leu Lys Glu Met Ala Gin. Ser He 
115 120 125 

Phe. Met Ala Gly He Leu He Gly Gly Leu Val Leu Gly Asp Leu Ser 
130 135 * 40 

Asp Arg Phe Gly Arg Arg Pro He Leu Thr Cys Ser Tyr Leu Leu Leu 
145 150 155 160 

Ala Ala Ser Gly Ser Gly Ala Ala Phe Ser Pro Thr Phe Pro He Tyr 
165 170 175 
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Met Val Phe Arg Phe Leu Cys Gly Phe Gly He Ser Gly He Thr Leu 
180 185 190 

Ser Thr Val He Leu Asn Val Glu Trp Val Pro Thr Arg Met Arg Ala 
5 195 200 205 

He Met Ser Thr Ala Leu Gly Tyr Cys Tyr Thr Phe Gly Gin Phe He 
210 215 220 

10 Leu Pro Gly Leu Ala Tyr Ala He Pro Gin Trp Arg Trp Leu Gin Leu 
225 230 235 240 

Thr Val Ser He Pro Phe Phe Val Phe Phe Leu Ser Ser Trp Trp Thr 
245 250 255 

15 

Pro Glu Ser lie Arg Trp Leu Val Leu Ser Gly Lys Ser Ser Lys Ala 
260 265 270 

Leu Lys He Leu Arg Arg Val Gly Cys Leu Gin Trp Gin Glu Gly Arg 
20 275 280 285 

Arg Arg Lys Leu Ser Leu Glu Glu Leu Lys Leu Asn Leu Gin Lys Glu 
290 295 300 

25 He Ser Leu Ala Lys Ala Asn Tyr Thr Ala Ser Asp Leu Phe Arg He 
305 310 315 320 

Pro Met Leu Arg Arg Met Thr Phe Cys Leu Ser Leu Ala Trp Phe Ala 
325 330 335 

30 

Thr Gly Phe Ala Tyr Tyr Ser Leu Ala Met Gly Val Glu Glu Phe Gly 
340 345 350 
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val Asn Leu Tyr He Leu Gin lie lie Phe Gly Gly Val Asp Val Pro 
355 



360 365 



Ala Lys Phe lie Thr lie Leu Ser Leu Ser Tyr Leu Gly Arg His Thr 
370 375 380 

Thr Gin Ala Ala Ala Leu Leu Leu Ala Gly Gly Ala lie Leu Ala Leu 

ion 395 400 

385 390 

,0 Thr Phe Val Pro Leu Asp Leu Gin Thr Val Arg Thr Val Leu Ala Val 

405 410- 415 

Phe Gly Lys Gly Cys Leu Ser Ser Ser Phe Ser Cys Leu Phe I*u Tyr 
420 425 430 

15 

Glu Leu Tyr Pro Thr Val He Arg Gin Thr Gly Met Gly Val 



Thr Ser 



435 



440 



445 



Ser Asn Leu Trp Thr Arg Val Gly Ser Met Val Ser Pro Leu Val Lys 



20 4 5 0 4 5 5 



lie Thr Gly Glu Val Gin Pro Phe lie Pro Asn lie lie Tyr Gly lie 

475 480 



465 



470 



Ala Leu Leu Gly Gly Ser Ala Ala Leu Phe Leu Pro Glu Thr Leu 
485 



25 Thr 

490 495 



Asn Gin Pro Leu Pro Glu Thr lie Glu Asp Leu Glu Asn Trp Ser Leu 



500 



505 



510 



30 



Arg Ala Lys Lys Pro Lys Gin Glu Pro Glu Val Glu Lys Ala Ser Gin 
515 520 " 5 
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Arg He Leu Tyr Ser Leu Thr Asp Gin Ala Trp Ala Pro Ala Glu Asp 
530 535 540 

Asn Gly Thr Pro Phe Pro Cys Pro Pro Glu Thr Asp Pro Ser Gin Ala 
5 545 550 555 560 



Pro 



10 

(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 6 amino acids 
15 (B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



20 



(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

25 (v) FRAGMENT TYPE: internal 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Mus musculus 



30 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 



Asp Arg Phe Gly Arg Lys 
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(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



10 



(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: YES 

15 (iv) ANTI-SENSE: NO 

(v) FRAGMENT TYPE: internal 

(ix) FEATURE: 
20 (A) NAME/KEY: Region 

(B) LOCATION : 2 

(D) OTHER INFORMATION : /note- "Xaa at position 2 is Arg or 
Lys" 

25 (ix) FEATURE: 

(A) NAME/KEY: Region 

(B) LOCATION : 5 

(D) OTHER INFORMATION : /note- "Xaa at position 5 is Arg or 
Lys" 



30 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 
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Asp Xaa Xaa Arg Xaa 
1 5 



(2) INFORMATION FOR SEQ ID NO:7: 

5 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 328 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
!0 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



15 



(iii) HYPOTHETICAL: NO 



(iv) ANTI- SENSE: NO 



20 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo sapiens 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:7: 

AAGTTCATCA CCATCCTCTC CTTAAGCTAC CTGGGCCGGC ATACCACTCA GGCCGCIGCC 60 

25 TGCTCCTGGC AGGAGGGGCC ATCTTGGCTC TCACCITTTG CCCTTGGACT TGCAGACOGT 120 

GAGACAGTAT TGGCTGTGTT TGGGAAGGGA TGCCTATCCA GCTCCITCAG CTGCCTCTTC 180 

CTCTACACAA GTGAATTATA CCCCACAGTTC ATCAGGCAAA CAGGTATGGG CGTAAGTAAC 240 

30 

CTGTGGACCC GCGTGGGAAG CATGGTGTCC CGCTGGTGAA AATCAOGGGT GAGGTACAGC 300 



WO 97/42321 



PCI7US97/07856 



-42- 



CCTTCATCCC CAATATCATC TACGGGAT 



328 
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CLAIMS 

1. An isolated nucleic acid molecule 

(a) which hybridizes under stringent conditions to a molecule consisting of the nucleic 
5 acid sequence of SEQ ID NO: 1 or SEQ ID NO:7 and which codes for an osteoclast transporter 

molecule, 

(b) nucleic acid molecules that differ from the nucleic acid molecules of (a) in codon 
sequence due to the degeneracy of the genetic code, and 

(c) complements of (a) and (b). 

10 

2. The isolated nucleic acid molecule of claim 1, wherein the isolated nucleic acid molecule 
hybridizes under stringent conditions to a molecule consisting of the nucleic acid sequence of 
SEQIDNO:3. 

15 3. The isolated nucleic acid molecule of claim 1 , wherein the isolated nucleic acid molecule 
consists essentially of SEQ ID NO:3. 

4. The isolated nucleic acid molecule of claim 1 , wherein the isolated nucleic acid molecule 
consists essentially of SEQ ID NO: 1 . 

20 

5. An isolated nucleic acid molecule selected from the group consisting of (a) a unique 
fragment of SEQ ID NO:l between 12 and 2101 nucleotides in length, (b) a unique fragment of 
nucleotides 1-2112 of SEQ ID NO:3 between 1 2 and 2111 nucleotides in length, (c) 
complements of "(a)" and (d) complements of "(b)". 

25 

6. The isolated nucleic acid molecule of claim 5, wherein the isolated nucleic acid molecule 
is a unique fragment of SEQ ID NO:l between 12 and 1974 nucleotides in length. 



30 



7. The isolated nucleic acid molecule of claim 5, wherein the isolated nucleic acid molecule 
is a unique fragment of nucleotides 120-1733 of SEQ ID NO: 1 between 1 2 and 1612 nucleotides 
in length. 



.% 
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8. The isolated nucleic acid molecule of claim 5, wherein the isolated nucleic acid molecule 
is selected from the group consisting of a unique fragment of nucleotides 1 -1 1 79 of SEQ ID 
NO:3 between 12 and 1 178 nucleotides in length and a unique fragment of nucleotides 1512- 
2112 of SEQ ID NO:3 between 12 and 599 nucleotides in length. 

5 

9. The isolated nucleic acid molecule of claim 5, wherein the isolated nucleic acid molecule 
is a unique fragment of nucleotides 73-1758 of SEQ ID NO:3 between 12 and 1684 nucleotides 
in length. 

10 10. The isolated nucleic acid molecule of claim 5, wherein the isolated nucleic acid molecule 
is selected from the group consisting of a unique fragment of nucleotides 73-1 1 79 of SEQ ID 
NO:3 between 12 and 1 105 nucleotides in length and a unique fragment of nucleotides 1512- 
1758 of SEQ ID NO:3 between 12 and 245 nucleotides in length. 

15 11. The isolated nucleic acid molecule of any of claims 5- 1 0, wherein the isolated nucleic 
acid molecule is selected from the group consisting of a unique fragment of at least 14 
contiguous nucleotides, a unique fragment of at least 15 contiguous nucleotides, a unique 
fragment of at least 16 contiguous nucleotides, a unique fragment of at least 17 contiguous 
nucleotides, a unique fragment of at least 1 8 contiguous nucleotides, a unique fragment of at 

20 least 20 contiguous nucleotides and a unique fragment of at least 22 contiguous nucleotides. 

12. The isolated nucleic acid molecule of any of claims 5-10, wherein the isolated nucleic 
acid molecule consists of between 12 and 32 contiguous nucleotides. 

25 13. A host cell transformed or transfected with an expression vector comprising the isolated 
nucleic acid molecule of any of claims 1 , 2, 3 or 4 operably linked to a promoter. 

14. An isolated polypeptide coded for by the isolated nucleic acid molecule of any of claims 
1,2, 3or4. 

30 

15. An isolated polypeptide selectively binding a protein coded for by the isolated nucleic 
acid molecule of any of claims 1 , 2, 3 or 4 . 



WO 97/42321 PCT/US9 7/07856 

-45- 

16. The isolated polypeptide of claim 15, wherein the isolated polypeptide is an Fab or F(ab) 
fragment of an antibody. 

17. The isolated polypeptide of claim 15 wherein the isolated polypeptide is a fragment of an 
5 antibody, the fragment including a CDR3 region selective for the protein. 

1 8. The isolated polypeptide of claim 1 5, wherein the isolated polypeptide is a monoclonal 
antibody. 

10 19. The isolated polypeptide of claim 15, 16 or 17, wherein the isolated polypeptide 
selectively binds an extracellular portion of the protein. 

20. A method for decreasing osteoclast activity in a subject comprising 

administering to a subject in need of such treatment an agent that selectively binds to an 
15 isolated nucleic acid molecule of claim 1 or an expression product thereof, in an amount effective 
to decrease osteoclast activity in said subject. 

21 . A method as claimed in claim 20, wherein the agent is a modified nucleic acid. 

20 22. A method as claimed in claim 20, wherein the agent is a polypeptide. 

23. The use of an agent that selectively binds to an isolated nucleic acid molecule of claim 1 , 
or an expression product thereof, in the preparation of a medicament. 

25 24. The use as claimed in claim 23, wherein the agent is a modified nucleic acid. 

25. The use as claimed in claim 23, wherein the agent is a polypeptide. 
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